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BACKGROUND OF THE INVENTION 
Field of the Invention: 

The present invention relates generally to a preprocessing device and method 
15 for recognizing image characters, and in particular, to a device and method for 
extending a character region in an image. 

Description of the Related Art: 

Generally, a preprocessing operation is performed to recognize image 
20 characters. The "preprocessing operation" refers to an operation of processing an image 
before recognition of the characters in the image. The image preprocessing operation 
can include an operation of deciding whether or not an input image is appropriate for 
character recognition, an operation of correcting the skew of an object in an input 
image, an operation of properly correcting the size of an input image, and an operation 
25 of binarizing an image signal (i.e., transforming an image function into a binary image) 
so that characters of the image signal can be recognized. 

A device for recognizing image characters generally recognizes characters 
from an image. The image is divided into a character region and a background region, 
30 and no character is arranged in the background region. For example, assuming that a 
document to be subject to character recognition is a business card, an input image 
becomes an image of the business card. The input image includes a background region 



outside the business card. In this case, it is possible to improve character recognition 
performance by extending the size of the image after removing the background region 
from the image. In addition, it is generally the case that no character region is included 
on the edges of the business card. Therefore, it is possible to improve recognition 
5 performance by searching for a position of a character region in a business card, 
removing regions other than the character region according to the search results, and 
then extending the character region by a percentage of the removed regions. Storing 
such preprocessed image contributes to in an increase in memory efficiency. 

1 0 SUMMARY OF THE INVENTION 

It is, therefore, an object of the present invention to provide a device and 
method for removing a background region from an image and then extending the 
character region in an image signal processing device. 

15 

It is another object of the present invention to provide a device and method for 
searching for a position of a character region in an image and removing regions outside 
the character region in an image signal recognition device. 

20 It is fiirther another object of the present invention to provide a device and 

method for searching for a position of a character region in an image, removing regions 
outside the character region, and then extending the character region in an image signal 
recognition device. 

25 In accordance with one embodiment of the present invention, there is provided 

a device for extending a character region in an image. The device comprises an input 
part for receiving an input image, a block classification part for classifying the input 
image into character blocks and background blocks, and converting pixels in the 
character blocks into pixels having a first brightness value and pixels in the background 

30 blocks into pixels having a second brightness value, and a position search part for 
searching for left, right, top and bottom positions of a character region by horizontally 
and vertically scanning the block-classified image, and determining a position of the 



character region. The device for extending a character region in an image further 
comprises a region of contents (ROC) extraction part for extracting an image in the 
determined position of the character region from the input image, and an ROC extension 
part for extending the extracted image of the character region to a size of the input 
5 image. 

In accordance with another embodiment of the present invention, there is 
provided a device for extending a character region in an image. The device comprises 
an input part for receiving an input image, a block classification part for classifying the 

10 input image into character blocks and background blocks, and converting pixels in the 
character blocks into pixels having a first brightness value and pixels in the backgroimd 
blocks into pixels having a second brightness value, a median filter for performing 
median filtering on an image output from the block classification part to remove blocks 
erroneously classified as character blocks and a position search part for searching for 

15 left, right, top and bottom positions of a character region by horizontally and vertically 
scanning the median- filtered image, and determining a position of the character region. 
The device for extending a character region in an image further comprises an ROC 
extraction part for extracting an image in the determined position of the character region 
from the input image, and an ROC extension part for extending the extracted image of 

20 the character region to a size of the input image. 

In accordance with a further embodiment of the present invention, there is 
provided a device for extending a character region in an image. The device comprises 
an input part for receiving an input image, a mean filter for performing mean filtering 

25 on the input image to blur the input image, a clock classification part for classifying the 
mean-filtered image into character blocks and background blocks, and converting pixels 
in the background blocks into pixels having a first brightness value and pixels in the 
background blocks into pixels having a second brightness value, and a median filter for 
performing median filtering on an image output from the block classification part to 

30 remove blocks erroneously classified as character blocks. The device for extending a 
character region in an image further comprises a position search part for searching for 
left, right, top and bottom positions of a character region by horizontally and vertically 
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scanning the median-filtered image, and detemiining a position of the character region, 
an ROC extraction part for extracting an image in the determined position of the 
character region from the input image, and an ROC extension part for extending the 
extracted image of the character region to a size of the input image. 

5 

In accordance with a further embodiment of the present invention, there is 
provided a device for extending a character region in an image. The device comprises 
an input part for receiving an input image, a mean filter for performing mean filtering 
on the input image to blur the input image, a block classification part for classifying the 

10 mean-filtered image into character blocks and background blocks, and converting pixels 
in the character blocks into pixels having a first brightness value and pixels in the 
background blocks into pixels having a second brightness value, and a subsampling part 
for subsampling pixels in the image output from the block classification part to reduce 
the number of the pixels. The device for extending a character region in an image 

15 further comprises a median filter for performing median filtering on the subsampled 
image to remove blocks erroneously classified as character blocks, an interpolation part 
for performing interpolating on the median-filtered image to extend the median-filtered 
image to a size of the input image, and a position search part for searching for left, right, 
top and bottom positions of a character region by horizontally and vertically scanning 

20 the block-classified image, and determining a position of the character region. The 
device for extending a character region in an image still fiirther comprises an ROC 
extraction part for extracting an image in the determined position of the character region 
fi-om the input image, and an ROC extension part for extending the extracted image of 
the character region to a size of the input image. 

25 

In accordance with still another embodiment of the present invention, there is 
provided a method for extending a character region in an image. The method comprises 
the steps of receiving an input image, classifying the input image into character blocks 
and background blocks, and converting pixels in the character blocks into pixels having 
30 a first brightness value and pixels in the background blocks into pixels having a second 
brightness value, and searching for left, right, top and bottom positions of a character 
region by horizontally and vertically scanning the block-classified image, and 
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determining a position of the character region; extracting an image in the determined 
position of the character region from the input image. The method for extending a 
character region in an image further comprises extending the extracted image of the 
character region to a size of the input image. 

5 

In accordance with still another embodiment of the present invention, there is 
provided a method for extending a character region in an image. The method comprises 
the steps of receiving an input image, classifying the input image into character blocks 
and background blocks, and converting pixels in the character blocks into pixels having 

10 a first brightness value and pixels in the background blocks into pixels having a second 
brightness value, performing median filtering on the block-classified image to remove 
blocks erroneously classified as character blocks, and searching for left, right, top and 
bottom positions of a character region by horizontally and vertically scanning the 
median-filtered image, and determining a position of the character region. The method 

15 for extending a character region in an image further comprises extracting an image in 
the determined position of the character region from the input image, and extending the 
extracted image of the character region to a size of the input image. 

In accordance with still another embodiment of the present invention, there is 
20 provided a method for extending a character region in an image. The method comprises 
the steps of receiving an input image, performing mean filtering on the input image to 
blur the input image, classifying the mean-filtered image into character blocks and 
background blocks, and converting pixels in the backgroimd blocks into pixels having a 
first brightness value and pixels in the background blocks into pixels having a second 
25 brightness value, and performing median filtering on the block-classified image to 
remove blocks erroneously classified as character blocks. The method for extending a 
character region in an image further comprises searching for left, right, top and bottom 
positions of a character region by horizontally and vertically scanning the median- 
filtered image, and determining a position of the character region, extracting an image 
30 in the determined position of the character region from the input image, and extending 
the extracted image of the character region to a size of the input image. 
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In accordance with still another embodiment of the present invention, there is 
provided a method for extending a character region in an image. The method comprises 
the steps of receiving an input image, performing mean filtering on the input image to 
blur the input image, classifying the mean-filtered image into character blocks and 
5 background blocks, and converting pixels in the character blocks into pixels having a 
first brightness value and pixels in the backgroxmd blocks into pixels having a second 
brightness value, and subsampling pixels in the block-classified image to reduce the 
number of the pixels. The method for extending a character region in an image further 
comprises performing median filtering on the subsampled image to remove blocks 

10 erroneously classified as character blocks, performing interpolating on the median- 
filtered image to extend the median-filtered image to a size of the input image, 
searching for left, right, top and bottom positions of a character region by horizontally 
and vertically scanning the block-classified image, and determining a position of the 
character region, extracting an image in the determined position of the character region 

1 5 fi-om the input image, and extending the extracted image of the character region to a size 
of the input image. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 The above and other objects, features and advantages of the present invention 

will become more apparent fi-om the following detailed description when taken in 

conjunction with the accompanying drawings in which: 

FIG 1 is a block diagram illustrating a structure of a device for extending a 

character region in an image according to a first embodiment of the present invention; 
25 FIG 2 is a block diagram illustrating a structure of a device for extending a 

character region in an image according to a second embodiment of the present 

invention; 

FIG 3 is a block diagram illustrating a detailed structure of the block 
classification part of FIGs. 1 and 2 in accordance with an embodiment of the present 
30 invention; 

FIG 4A to FIG 4C are diagrams illustrating a characteristic for determining a 
sum of absolute values of DCT coefficients in each block; 
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FIG 5 is a flowchart illustrating a method for extending a character region in 
an image according to a first embodiment of the present invention; 

FIG 6 is a flowchart illustrating a method for extending a character region in 
an image according to a second embodiment of the present invention; 
5 FIG 7 is a flowchart illustrating a detailed method of the block classification 

process of FIGs. 5 and 6 in accordance with an embodiment of the present invention; 

FIG 8 is a flowchart illustrating a detailed method of the position search 
process of FIGs. 5 and 6 in accordance with an embodiment of the present invention; 

FIG 9 is a flowchart illustrating a method for extending a character region in 
10 an image according to an embodiment of the present invention; and 

FIGs. lOA and lOH are diagrams illustrating images generated in the procedure 
of FIG 9. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

15 

In the following description, specific details such as the size of an image and 
sizes of character and background blocks are provided for a better understanding of the 
present invention. It would be obvious to those skilled in the art that the invention can 
be easily implemented without such specific details or by modifying the same. 

20 

In embodiments of the present invention, an input image is assumed to have a 
size of 640x480 pixels. The term "block" means character and background blocks, and 
it is assumed herein that each of the blocks has a size of 8x8 pixels. In addition, the term 
"outside region" refers to unwanted regions other than a character region in the image. 

25 

Preferred embodiments of the present invention will now be described in detail 
with reference to the annexed drawings. 

FIG 1 is a block diagram illustrating a structure of a device for extending a 
30 character region in an image according to a first embodiment of the present invention. 
Referring to FIG. 1, input part 110 has the fiinction of receiving an input image. Input 
part 1 1 0 can be a camera, scanner, a communication interface including a modem and a 
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network, or a computer, as well as other devices.. It is assumed herein the input image is 
comprised of 640 (column) x 480 (row) pixels. 

Block classification part 120 divides the input image received from the input 
5 part 110 into blocks having a preset block size, and classifies the divided blocks into 
character blocks and background blocks by analyzing pixels included in the divided 
blocks. The block classification part 120 then converts pixels in the classified character 
blocks into pixels having a specific value. 

Median filter 130 performs median filtering on an image output from the block 
10 classification part 120 to remove erroneously classified character regions that are 
actually edges or image noise. After the image is subject to block classification, it can 
include isolated character blocks generated by edges or noises. The median filter 130 
has the function of removing the erroneously classified character blocks (isolated 

character blocks) created in the block classification process by edges or image noise. 

c 

15 Position search part 140 horizontally and vertically scans the median-filtered 

image and searches for a position of the character region. The position search part 140 
horizontally scans the median-filtered image and searches for a point xl at the leftmost 
character block and a point x2 at the rightmost character block. The position search part 
140 also vertically scans the median-filtered image, and searches for a point yl at the 

20 topmost character block and a point y2 at the bottommost character block. A position of 
the character region in the image is determined according to a result of the search. In 
this case, the left top and right bottom points of the character region are (xl, yl) and 
(x2, y2). The left top and right bottom points (xl, yl) and (x2, y2) of the character 
region are based on the aspect ratio of the input image, such that distortion of the image 

25 can be prevented when an region-of-contents (ROC) extension part 160 extends the 
image. 

An ROC extraction part 150 extracts an image of the character region searched 
by the position search part 140. The ROC extraction part 150 receives information 
30 associated with the left top and right bottom points (xl , yl) and (x2, y2) of the character 
region searched by the position search part 140, and extracts an image located between 
the left top and right bottom points (xl, yl) and (x2, y2) of the character region from the 



input image output from the input part 110. Accordingly, an image output from the 
ROC extraction part 150 becomes an image of the character region in which the 
background region is removed from the input image. 

5 The ROC extension part 160 extends the image of the extracted character 

region to the size of the input image. The image extension can be implemented by 
interpolation. In an exemplary embodiment of the present invention, the image 
extension can be implemented by bilinear interpolation, although, as those skilled in the 
art can appreciate, other methods can also be used to perform the interpolation. The 
10 image extension is achieved by the interpolation operation so that the size of the image 
of the extracted character region can be equal to that of the input image. 

Recognition part 170 accesses the extended image and recognizes characters 
from the accessed image. 

15 

Operation of the device for extending a character region according to the first 
embodiment of the present invention will now be described in detail. First, the block 
classification part 120 divides an input image into blocks, classifies the divided blocks 
into character blocks and background blocks, and converts the classified character 

20 blocks into pixels having a first brightness value and the classified background blocks 
into pixels having a second brightness value (binarization). The reason that the block 
classification part 120 classifies the blocks into character blocks and background blocks 
and then fills the classified character blocks and background blocks with pixels having 
diflFerent brightness values is to display character regions of the image. As mentioned 

25 above, it is assumed that each of the blocks has a size of 8x8 pixels. Following block 
classification, the median filter 130 then performs median filtering on the image output 
from the block classification part 120, to remove erroneously classified character blocks 
of the image. The median filter 130 performs the fiinction of removing isolated 
character blocks erroneously classified as character blocks by image noise in the block 

30 classification process. 

The position search part 140 searches for a position of a character region by 



horizontally and vertically scanning the median-filtered image. The position search part 
140 searches for a point xl at the leftmost character block and a point x2 at the 
rightmost character block by horizontally scanning the median-filtered image, and 
stores the result values. Further, the position search part 140 searches for a point yl at 
5 the topmost character block and a point y2 at the bottommost character block by 
vertically scanning the median-filtered image, and stores the result values. Thereafter, 
the position search part 140 determines left top and right bottom points (xl, yl) and (x2, 
y2) of the character region according to the search results. The left top and right bottom 
points (xl, yl) and (x2, y2) of the character region are determined based on the aspect 

10 ratio of the input image, such that distortion of the image can be substantially reduced or 
eliminated when the ROC extension part 160 extends the image. In the embodiment of 
the present invention, the left top (xl, yl) and right bottom (x2, y2) points of the 
character region are determined so that the ratio of width to length associated with the 
character region searched by the position search part 140 is 4:3 since the ratio of width 

15 to length associated with the input image is 4:3 (i.e., 640:480 pixels),. 

The ROC extraction part 150 extracts the image of the character region 
searched by the position search part 140. The ROC extraction part 150 receives 
information associated with the left top (xl, yl) and right bottom (x2, y2) points of the 

20 character region searched by the position search part 140, and extracts an image located 
between the left top and right bottom points (xl, yl) and (x2, y2) of the character region 
fi-om the input image output fi-om the input part 110. On the basis of the left top and 
right bottom points (xl, yl) and (x2, y2) of the character region, the ROC extraction 
part 150 extracts, as character region pixels, pixels between the point xl and the point 

25 x2 in the horizontal direction and pixels between the point yl and the point y2 in the 
vertical direction fi-om the input image. An output image fi-om the ROC extraction part 
150 becomes an image of the character region in which a background region is removed 
from the input image. 

30 The ROC extension part 160 extends the image of the extracted character 

region to the size of the input image. Image extension can be implemented by 
interpolation. It is assumed herein that the image extension is implemented by bilinear 
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inteqjolation, though, as those skilled in the art can appreciate, other methods can also 
be used to perform the interpolation. The image extension is achieved by the 
interpolation operation so that the size of the image of the extracted character region 
becomes equal to that of the input image. 

5 

The recognition part 170 receives the extended image output from the ROC 
extension part 160 and recognizes characters from the received image. Although the 
first embodiment of the present invention has been described wherein the proposed 
character region extension device serves as a preprocessor of a recognizer, the proposed 
10 character region extension device can be used as a device for editing an image and 
storing the edited image in 'an image processing device. 

FIG 2 is a block diagram illustrating a structure of a device for extending a 
character region in an image according to a second embodiment of the present 
15 invention. Referring to FIG. 2, input part 110 has the function of receiving an input 
image. Input part 110 can be a camera, scanner, a communication interface including a 
modem and a network, or a computer, as well as other devices.. It is assumed herein the 
input image is comprised of 640 (column) x 480 (row) pixels. 

20 Mean filter 180 performs mean filtering on the input image and makes a 

blurred image. The mean filtering is performed to reduce the influence of the 
background region outside the character region in the block classification process that 
follo^ys, by blurring the input image. 

25 Block classification part 120 divides an image output from the mean filter 180 

into blocks, analyzes pixels included in the divided blocks, classifies the blocks into 
character blocks and background blocks, and converts pixels in the character blocks into 
pixels having a specified value. The block classification part 120 classifies the blocks 
into character blocks and background blocks in order to extract a character region by 

30 converting the pixels in the character blocks into pixels having a specified value. Here, 
it is assumed that each of the blocks consists of 8 x 8 pixels. 



Subsampling part 190 subsamples an output image from the block 
classification part 120 to reduce the number of image pixels. The subsampling part 190 
reduces the number of image pixels in order to increase the filtering rate by decreasing 
the filter window in the following median filtering process. In the second embodiment 
5 of the present invention, it is assumed that the pixel reduction ratio is (2:1)^. In this 
case, the subsampling part 190 performs 2:1 subsampling on horizontal pixels and 
performs 2:1 subsampling on vertical pixels, such that the number of image pixels in the 
mage is reduced to Va, of the original value. 

10 Median filter 130 performs median filtering on the image output from the 

subsampling part 190, and removes erroneously classified character blocks from the 
input image. The median filter 130 performs the function of removing the isolated 
character blocks erroneously classified as character blocks due to image noise in the 
block classification process. 

15 

Interpolation part 195 performs interpolation on pixels in the image output 
from the median filter 130 to extend the image. In the second embodiment of the present 
invention, it is assumed that the interpolation ratio is (2:1)^. Interpolation part 195 
performs a 2:1 interpolation on horizontal and vertical pixels of the output image from 
20 the median filter 130 to extend the image four times. The interpolation operation is 
performed in order to search for the correct position of the character region and to 
extend the size of the image reduced by the subsampling process to that of the original 
image. 

25 Position search part 140 horizontally and vertically scans the median-filtered 

image and searches for a position of the character region. The position search part 140 
horizontally scans the median-filtered image and searches for a point xl at the leftmost 
character block and a point x2 at the rightmost character block. Furthermore, the 
position search part 140 vertically scans the median-filtered image, and searches for a 

30 point yl at the topmost character block and a point y2 at the bottommost character 
block. The position of the character region in the image is determined according to the 
result of the search. The left top and right bottom points of the character region are (xl. 
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yl) and (x2, y2). The left top and right bottom points (xl, yl) and (x2, y2) of the 
character region are determined based on the aspect ratio of the input image, such that 
distortion of the image can be substantially reduced or eliminated when the following 
ROC extension part 160 extends the image. 

5 

ROC extraction part 150 extracts the image of the character region searched by 
the position search part 140. The ROC extraction part 150 receives information 
associated with the left top and right bottom points (xl, yl) and (x2, y2) of the character 
region searched by the position search part 140, and extracts an image located between 
10 the left top and right bottom points (xl , yl) and (x2, y2) of the character region from the 
input image output from the input part 110. Accordingly, the image output from the 
ROC extraction part 150 becomes an image of the character region in which the 
background region is removed from the input image. 

15 The ROC extension part 160 extends the image of the extracted character 

region to the size of the input image. The image extension can be implemented by 
interpolation. It is assumed herein that the image extension is implemented by bilinear 
interpolation, though, as those skilled in the art can appreciate, other methods of 
interpolation can also be used. The image extension is achieved by the interpolation 

20 operation so that the size of the image of the extracted character region can be equal to 
that of the input image. 

Recognition part 170 accesses the extended image and recognizes characters 
from the accessed image. 

25 

Operation of the character region extension device according to the second 
embodiment of the present invention will now be described in detail. First, the input part 
110 receives an image having a size of N x M pixels. As mentioned above, it is assumed 
herein that the image has a size of 640 (N) x 480 (M) pixels. The input image can be a 
30 color image or a grayscale image not having color information. In the second 
embodiment of the present invention, it is assumed that the image is a grayscale image. 
The input part 110 for receiving the image can be a camera, a scaimer, a communication 
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interface including a modem and a network, a computer, or another type of device that 
can provide an image. The input part 110 outputs the input image to the mean filter 180, 
that performs mean filtering on the input image, and makes a blurred image so that the 
background region outside the character region of the image does not affect the 
5 character region classification process by the block classification part 120, which 
follows the mean filter part 180. An example of such a mean filter is disclosed in a 
reference entitled "Digital Image Processing," by R. C. Gonzalez, R. Woods, et al., 2nd 
ed.. Prentice Hall, pp. 119-123, 2002, the contents of which are incorporated herein by 
reference. 

10 

The mean-filtered image is applied to the block classification part 120. The 
block classification part 120 divides the image output fi:'om the mean filter 180 into 
blocks, analyzes pixels contained in the blocks, classifies the blocks into character 
blocks and background blocks, and converts the pixels of the classified character blocks 
1 5 into pixels having a specified value. 

FIG 3 is a block diagram illustrating a detailed structure of the block 
classification part 120 in accordance with an embodiment of the present invention. 
Referring to FIG 3, an image division part 211 divides the image into blocks having a 
20 predetermined size. Here, the image consists of 640 x 480 pixels, and each of the blocks 
consists of 8 X 8 pixels. In this case, the image division part 211 divides the image into 
4800 blocks. 

The blocks output fi-om the image division part 211 are applied to a discrete 
cosine transform (DCT) conversion part 213, and the DCT conversion part 213 

25 performs a DCT conversion on the blocks. An energy calculation part 215 calculates a 
sum of absolute values of dominant DCT coefficients within the DCT-converted blocks. 
In this case, the energy distribution value of the DCT coefficients within the character 
blocks are larger than that of the DCT coefficients within the background blocks. FIG 
4A is a diagram illustrating a comparison of energy distributions of DCT coefficients for 

30 the character blocks and the background blocks. In FIG 4A, the Y axis represents an 
average of the absolute sxmis in a log scale, and the X axis represents a zigzag scan 
order of the DCT coefficients. As illustrated in FIG 4A, it can be noted that DCT 
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coefficients of the character blocks are larger in their average values than the DCT 
coefficients of the background blocks. FIG. 4B is a diagram illustrating an energy 
distribution characteristic of DCT coefficients for the character blocks, hi FIG 4B, the 
Y axis represents an average of the absolute sums in a normal scale, and the X axis 
5 represents a zigzag scan order of the DCT coefficients. As illustrated in FIG 4B, it can 
be noted that the average of absolute sums of some DCT coefficients for the character 
blocks are relatively larger. Thus, in the first and second embodiments of the present 
invention, it is assumed that the dominant DCT coefficients used in the block 
classification process are Di D9 shown in FIG 4C. Accordingly, the sum of the 
10 absolute values of the dominant DCT coefficients in a k**' block can be calculated by 

S'=t,\D^\ (1) 

1=1 

In Equation (1), |jDf| denotes an i^^ dominant DCT coefficient of the k'^ block, 

15 and 5* denotes the sum of the absolute values of the dominant DCT coefficients in the 
k^^ block. Thus, in the first and second embodiments of the present invention, the sum of 
the dominant DCT coefficients Di ~ D9 is calculated. 

The energy calculation part 215 performs a calculation of Equation (1) on all 
20 blocks (at k = 0, 1, 2, ... , 4799). Thereafter, energy values (k = 0, 1, 2, ... , 4799) 
calculated block by block are appHed to a block threshold calculation part 217. 

The block threshold calculation part 217 sums up the energy values 5* (k = 0, 
1, 2, ... , 4799) calculated block by block, and produces an average <5*> by dividing 
25 the summed energy value by the total number (TBN) of blocks. The average value <*S*) 
is produced in accordance with Equation (2) below. The average value (5* > becomes a 
block threshold Cth used for determining the blocks as character blocks or background 
blocks. 
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= Cth 



(2) 



In Equation (2), TBN denotes the total number of blocks. 

5 Classification part 219 sequentially receives the energy values (corresponding 

to sums of the absolute values of dominant DCT coefficients for the blocks) output from 
the energy calculation part 215 on a block-by-block basis. The classification part 219 
classifies each corresponding block as a character block or a backgroimd block by 
comparing the received block energy values with a block threshold Cth . The 
10 classification part 219 classifies the k^^ block as a character block (CB) if 5* > Cth and 
classifies the k^^ block as a background block (BB) if < Cth as shown in Equation 
(3). 

IFS'>Cth thenCB 

(3) 

else BB 

15 

Pixels in the character blocks classified by the classification part 219 can have 
gray levels between 0 and 255. A block filling part 221 then converts pixels of a 
character block classified by the classification part 219 into pixels having the first 
brightness value, and converts pixels of a background block into pixels having the 

20 second brightness value. In the embodiment of the present invention, it is assumed that 
the block filling part 221 converts the pixels in the character block into white pixels, and 
converts the pixels in the background block into black pixels. Thus, the block filling 
part 221 fills the character blocks of the image with the white pixels and fills the 
background blocks of the image with the black pixels. The character blocks and 

25 background blocks are filled with pixels of different brightness values after the block 
classification part 120 classifies the blocks into the character blocks and backgroimd 
blocks in order to appropriately display character regions. 

Thereafter, the subsampling part 190 subsamples the image output fi-om the 
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block classification part 120 to reduce the number of horizontal and vertical pixels. The 
subsampling part 190 reduces the number of image pixels in order to increase the 
filtering rate by decreasing the filter window in the median filtering process that 
follows. In the second embodiment of the present invention, it is assumed that the pixel 
5 reduction ratio is (2 : 1)^ , In this case, the number of pixels of the output image fi-om the 
block classification part 120 is reduced to of that of the original value. Thus, the size 
of the reduced image is decreased to 320 x 240 pixels. 

Following the subsampling performed by the subsampling part 190, the median 
10 filter 130 performs median filtering on the image output fi"om the subsampling part 190, 
and removes background blocks and erroneously classified character blocks from the 
input image. The median filter 130 performs the function of removing the isolated 
blocks erroneously classified as character blocks due to the noise in the block 
classification process. An example of such a median filter is disclosed in a reference 
1 5 entitled "Fundamental of Digital Image Processing," by A. K. Jain, Prentice Hall, pp. 
246-249, the entire contents of which are incorporated herein by reference. 

After the median filtering on the image, the interpolation part 195 performs 
interpolation on horizontal and vertical pixels of an output image firom the median filter 
20 130 to extend the image to the size of the input image. In the second embodiment of the 
present invention, it is assumed that an interpolation ratio (2:1)^. The interpolation 
operation is performed in order to search for a correct position of the character region 
and to extend the size of the image reduced by the subsampling process to that of the 
original image. 

25 

The position search part 140 horizontally and vertically scans the median- 
filtered image and searches for the position of the character region. The position search 
part 140 horizontally scans the median-filtered image, searches for a point xl at the 
leftmost character block and a point x2 at the rightmost character block, and saves the 
30 result of the search. Furthermore, the position search part 140 vertically scans the 
median-filtered image, searching for a point yl at the topmost character block and a 
point y2 at the bottommost character block, and stores the result of the search. The left 



- 17- 



top and right bottom points (xl, yl) and (x2, y2) of the character region depend upon 
the resuhs of the searches. The left top and right bottom points (xl, yl) and (x2, y2) of 
the character region are determined based on the aspect ratio of the input image, such 
that the distortion of the image can be substantially reduced or ehminated when the 
5 following ROC extension part 1 60 extends the image. In the second embodiment of the 
present invention, the left top and right bottom points (xl, yl) and (x2, y2) of the 
character region are determined so that a ratio of width to length associated with the 
character region searched by the position search part 140 becomes 4:3 since the ratio of 
width to length associated with the input image is 4:3 (i.e., 640:480 pixels).. 

10 

The ROC extraction part 150 extracts the image of the character region 
searched by the position search part 140. The ROC extraction part 150 receives 
information associated with the left top and right bottom points (xl, yl) and (x2, y2) of 
the character region searched by the position search part 140, and extracts an image 

15 located between the left top and right bottom points (xl, yl) and (x2, y2) of the 
character region from the input image output from the input part 110. On the basis of the 
left top and right bottom points (xl, yl) and (x2, y2) of the character region, the ROC 
extraction part 150 extracts, as character region pixels, pixels between the point xl and 
the point x2 in the horizontal direction and pixels between the point yl and the point y2 

20 in the vertical direction. The image output from the ROC extraction part 150 becomes 
an image of the character region in which the backgroimd region has been removed 
from the input image. 

The ROC extension part 160 extends the image of the extracted character 
25 region to the size of the input image. The image extension can be implemented by 
interpolation. In the second embodiment of the present invention, it is assumed that the 
image extension is implemented by a bilinear interpolation defined as: 

v(jc, j^) = (1 - Ajc)(1 - Ay)u{m, az) -f (1 - Ax) Ay u{m^ « 4- 1) 
+ Ax{\ - Ay)u{m + 1, «) + AxAy u{m + 1, « + 1) 

(4) 

where Ax = x — m 
Ay = y-n 
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Here, the image extension is achieved by the interpolation operation so that the 
size of the image of the extracted character region can be equal to that of the input 
image. This bilinear interpolation is disclosed in a reference entitled "Numerical 
5 Recipes in C/' by W. H. Press, S. A. Teukolsky, et al., 2nd ed., Cambridge, pp. 123-125, 
1988, the entire contents of which are incorporated herein by reference. 

The recognition part 170 receives the extended image output from the ROC 
extension part 160 and recognizes characters from the received image. Although the 
10 second embodiment of the present invention has been described with reference to an 
embodiment in which the proposed character region extension device serves as a 
preprocessor of a recognizer, the proposed character region extension device can be 
used as a device for editing an image and storing the edited image in an image 
processing device. 

15 

FIG 5 is a flowchart illustrating a method for extending a character region in 
an image according to a first embodiment of the present invention. Referring to FIG 5, 
it is determined in decision step 311 whether an input image is received. If an input 
image has been received ("Yes'' path from decision step 311), the input image is divided 

20 into blocks having a predetermined size, the divided blocks are classified into character 
blocks and background blocks, pixels in the classified character blocks are converted 
into pixels having a first brightness value, and pixels in the classified background blocks 
are converted into pixels having a second brightness value, in step 313. In step 315, the 
block-classified image is subject to median filtering to remove erroneously classified 

25 character regions from the image. The median filtering process is performed to remove 
isolated character blocks left in the image as they are erroneously classified as character 
blocks due to noise in the block classification process. 

In step 317, which follows step 315, the median-filtered image is horizontally 
30 and vertically scanned to search for a position of the character region. The position 
search part 140 horizontally scans the median-filtered image, searches for a point xl at 
the leftmost character block and a point x2 at the rightmost character block, and saves 
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the result values. The position search part 140 then vertically scans the median-filtered 
image, searches for a point yl at the topmost character block and a point y2 at the 
bottommost character block, and saves the result values. Thereafter, left top and right 
bottom points (xl, yl) and (x2, y2) of the character region in the image are determined 
5 according to the search results. In this case, the left top and right bottom points (xl, yl) 
and (x2, y2) of the character region are determined based on the aspect ratio of the input 
image, such that distortion of the image can be prevented when the following ROC 
extension part 160 extends the image. 

10 In step 319, the image of the searched character region is extracted from the 

input image. The ROC extraction part 150 receives information associated with the left 
top and right bottom points (xl, yl) and (x2, y2) of the character region searched by the 
position search part 140, and extracts the image located between the left top and right 
bottom points (xl, yl) and (x2, y2) of the character region ft*om the input image output 

15 fi*om the input part 110. On the basis of the left top and right bottom points (xl, yl) and 
(x2, y2) of the character region, the ROC extraction part 150 extracts, as an image of the 
character region, the image located between the point xl and the point x2 in the 
horizontal direction and the image located between the point yl and the point y2 in the 
vertical direction fi-om the input image. The image output fi-om the ROC extraction part 

20 150 becomes an image of the character region in which the backgroxmd region is 
removed fi-om the input image. Accordingly, the image output firom the ROC extraction 
part 150 becomes the image of the character region in which an unwanted outside 
region is removed from the input image. 

25 Thereafter, in step 321, the image of the extracted character region is extended 

to the size of the input image. The image extension can be implemented by 
interpolation. It is assumed herein that the image extension is implemented by the 
bilinear interpolation defined in Equation (4). The image extension is achieved by the 
interpolation operation so that the size of the image of the extracted character region can 

30 be equal to that of the input image. 

In step 323, the extended image is output to the character recognition part 170, 
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and the character recognition part 170 recognizes characters from the extended image. 
When occasion demands, the extended image can be used in an image processing device 
that edits an image and saves the edited image. 

5 FIG 6 is a flowchart illustrating a method for extending a character region in 

an image according to a second embodiment of the present invention. Comparing FIG 6 
(and FIG 2) with FIG 5 (and FIG 1), the character region extension method show in 
FIG 6 further includes the process of mean-filtering an image (using the mean filter part 
180 shown in FIG 2) before block classification, and subsampling (subsampler part 190 
10 in FIG 2) and interpolation (interpolation part 195 in FIG 2) processes before and after 
median filtering, in addition to the character region extension method shown in FIG 1 . 
The processes of FIG 6 other than these immediately identified above processes are 
identical in operation to their corresponding processes shown in FIGs. 5. 

15 Referring to FIG 6, in decision step 311, the input part 110 receives an image 

having a size of N x M pixels ("Yes" path from decision step 311). As discussed above, 
it is assumed herein that the image has a size of 640 (N) x 480 (M) pixels. The input 
image can be a color image or grayscale image not having color information. In the 
second embodiment of the present invention, it is assumed that the image is a grayscale 

20 image. Thereafter, in step 312, the image is subject to mean filtering to make a blurred 
image, so that the background region outside the character region of the image does not 
affect the character region classification process. 

Thereafter, in step 313, the mean-filtered image is divided into blocks having a 
25 preset size, pixels contained in the blocks are analyzed, the blocks are classified into 
character blocks and background blocks, and pixels of the classified character blocks 
are converted into pixels having a specified value. 

FIG 7 is a flowchart illustrating a detailed procedure of the block classification 
30 process of step 313. Referring to FIG 7, in step 411, the input image is divided into 
blocks having a predetermined size. Here, the image consists of 640 x 480 pixels, and 
each of the blocks consists of 8 x 8 pixels. In this case, the image is divided into 4800 
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blocks. 



Thereafter, a block number BN is set to 0 in step 413, and a block with the 
block number BN is accessed in step 415. In step 417, the accessed block is subject to 
5 DCT conversion. In step 419, the sum of the absolute values of dominant DCT 
coefficients within the DCT-converted block #BN is calculated using Equation (1), and 
then saved. In this case, the energy distribution value of the DCT coefficients within the 
character blocks is larger than that of DCT coefficients within the background blocks. 
Energy distributions of the DCT coefficients for the character blocks and the 

10 background blocks show the characteristic illustrated in FIG 4A. Further, an energy 
distribution characteristic of DCT coefficients for the character blocks shows the 
characteristic illustrated in FIG 4B. Therefore, the sum S*^ of absolute values of the 
DCT coefficients in the k^^ block can be calculated in accordance with Equation (1). 
Here, 'k' is the same parameter as BN, and denotes a block number. After the is 

15 saved in step 419, it is determined in decision step 421 whether of the last block is 
calculated. If of the last block is not calculated yet ("No" path from decision step 
421), the procedure increases the block number by one in step 423, and then returns to 
step 415 to repeat the above operation. 

20 Through repetition of the steps 415 to 423, the respective blocks are subject to 

DCT conversion and the calculation of Equation (1) is performed on all blocks (at k = 0, 
1,2, ... , 4799). In step 425 (which proceeds from the "Yes" path of decision step 421), 
a threshold Cth is calculated using energy values 5* (k = 0, 1, 2, ... , 4799) calculated 
block by block. The energy values (k = 0, 1, 2, . , 4799) calculated block by block 

25 are summed, and an average (5*) is produced by dividing the summed energy value by 
the total number TBN of blocks. The average value <5'*> is produced in accordance 
with Equation (2). The average value (S^) becomes a block threshold Cth used for 
determining the blocks as character blocks or background blocks. 

30 After the threshold Cth is calculated, an operation of classifying the blocks into 

character blocks and background blocks is performed. For that purpose, a block number 
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BN is initialized to '0' in step 417, and S*^ of a block with the block number BN is 
accessed in step 429. Thereafter, in decision step 431, the classification part 219 
classifies a corresponding block as a character block or a background block by 
comparing of the block with the block threshold Cth , The classification part 219 
5 classifies the k^^ block as a character block if > Cth ("Yes" path from decision step 
431) and classifies the k'^ block as a background block if 5* <Cth as shown in 
Equation (3) ("No" path firom decision step 43 1). ^ 

Pixels in the classified character blocks can have gray levels between 0 and 
10 255. In the embodiments of the present invention, since only the character region is 
extracted from the image, it is necessary to definitely distinguish the character region 
fi-om the background region in the block classification process. Therefore, if a 
corresponding block is classified as a character block in step 433, pixels in the classified 
character block are converted into pixels having a first brightness value in step 435. 
1 5 Otherwise, if a corresponding block is classified as a background block in step 437, 
pixels in the classified background block are converted into pixels having a second 
brightness value in step 439. In the embodiments of the present invention, it is assumed 
that the pixels in the character block are converted into white pixels, while the pixels in 
the background block are converted into black pixels. Thus, by filling the character 
20 blocks with the white pixels and filling the background blocks with the black pixels in 
the block classification process, the image is definitely distinguished into character 
blocks and background blocks. 

After determining whether a block number (BN) is a character block or a 
25 backgroimd block, and the pixels therein are converted into pixels with a corresponding 
brightness value through steps 429 to 439, it is determined in decision step 441 whether 
the classified block is the last block. If the (BN) is not the last block ^TMo" path fi"om 
decision step 441), the procedure increases the block number by one in step 443, and 
then returns to step 429 to repeat the above operation. When the above operation is 
30 completely performed ("Yes" path firom decision step 441), the block classification 
results are output. After the image is divided into the blocks, an operation of classifying 
the blocks into character blocks and background blocks and correcting brightness values 
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of pixels in the classified blocks is performed. 

When the block classification operation of FIG 7 is performed in step 313 of 
FIG 6, the image is classified into character blocks and background blocks. Further, 
5 pixels in the classified character blocks are converted into white pixels, while pixels in 
the background character blocks are converted into black pixels. In this manner, pixels 
in the classified blocks of the image are corrected into white pixels or black pixels. 

Thereafter, in step 314, the image is subject to subsampling to reduce the 
10 number of horizontal and vertical pixels. The subsampling is performed to increase the 
filtering rate by decreasing the filter window in the following median filtering process. 
Assuming that the subsampling ratio is (2 : 1)^ , the horizontal and vertical pixels of the 
image are subject to 2:1 subsampling, so that the number of pixels is reduced to Va of the 
original value. In this case, the size of the reduced image is 320 x 240 pixels. After the 
15 subsampling is performed, the reduced image is subject to median filtering in step 315. 
The median filtering is performed to remove the isolated blocks left in the image as they 
are erroneously classified due to the edges or noise of the image. After the erroneously 
classified character blocks are removed by performing the median filtering, the 
horizontal and vertical pixels of the median-filtered image are subject to interpolation to 
20 extend the image to a size of the input image in step 316. 

Thereafter, in step 317, the interpolated image whose size is extended to its 
original size is horizontally and vertically scanned to search for a position of a character 
region. The position search part 140 horizontally scans the median-filtered image, 

25 searches for a point xl at the leftmost character block and a point x2 at the rightmost 
character block, and saves a result of the search. Furthermore, the position search part 
140 vertically scans the median-filtered image, searches for a point yl at the topmost 
character block and a point y2 at the bottommost character block, and stores a result of 
the search. The left top and right bottom points (xl, yl) and (x2, y2) of the character 

30 region depend upon the results of the searches. The left top and right bottom points (xl, 
yl) and (x2, y2) of the character region are determined based on the aspect ratio of the 
input image, such that the distortion of the image can be substantially reduced or 
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eliminated when the following ROC extension part 160 extends the image. In the 
embodiments of the present invention, the left top and right bottom points (xl, yl) and 
(x2, yl) of the character region are determined so that the ratio of width to length 
associated with the character region searched by the position search part 140 becomes 
5 4:3 since the ratio of width to length associated with the input image is also 4:3 (i.e., 
640:480 pixels).. 

FIG 8 is a flowchart illustrating a detailed procedure of the position search 
process of step 317. Referring to FIG 8, the median-filtered image is received in step 

10 511, and a horizontal scan parameter HSN and a vertical scan parameter VSN are both 
initialized to '0' in step 513. Thereafter, a position with the HSV is scanned in step 515, 
and it is determined in decision step 517 whether the scanned position with the HSN is 
in a character region. If the scanned position with the HSN is in a character region 
("Yes" path from decision step 517), an x coordinate value of the HSN is saved in step 

15 519. Thereafter, it is determined in decision step 521 whether the HSN is a value of the 
last horizontal (or rightmost) scan position. If the HSN is not a value of the last 
horizontal scan position ("No" path from decision step 521), the procedure determines 
the next horizontal scan position in step 523, and then retums to step 515 to repeat the 
above operation (steps 511-521). If horizontal scanning is completely performed on up 

20 to the last HSN horizontal scan position through repetition of the above operation, the 
completed horizontal scanning is detected in step 521. Thereafter, in step 525, a 
coordinate value xl of a left position and a coordinate value x2 of a right position, 
scanned as a character region, are determined and saved. 

25 A position with the VSN is scanned in step 527, and it is determined in 

decision step 529 whether the scanned position with the VSN is in a character region. If 
the scanned position with the VSN is in a character region ("Yes" path from decision 
step 529), a y coordinate value of the VSN is saved in step 531. Thereafter, it is 
determined in decision step 533 whether the VSN is a value of the last vertical (or 

30 bottommost) scan position. If the VSN is not a value of the last vertical scan position 
("No" path from decision step 533), the procedure determines the next vertical scan 
position in step 535, and then retums to step 527 to repeat the above operation. If 
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vertical scanning is completely performed on up to the last VSN vertical position 
through repetition of the above operation (steps 527-531), the completed vertical 
scanning is detected in decision step 533. Thereafter, in step 537, a coordinate value yl 
of an upper position and a coordinate value y2 of a lower position, scanned as a 
5 character region, are determined and saved. 

Thereafter, in step 539, left top and right bottom points (xl, yl) and (x2, y2) of 
the character region in the image are determined according to the search results. The left 
top and right bottom points (xl, yl) and (x2, y2) of the character region are determined 

10 based on an aspect ratio of the input image, so that distortion of the image can be 
substantially reduced or prevented when the ROC extension part 160 extends the image. 
In this embodiment of the present invention, the left top and right bottom points (xl, yl) 
and (x2, y2) of the character region are determined so that the ratio of width to liength 
associated with the character region searched by the position search part 140 is 4:3, 

15 since the ratio of width to length associated with the input image is also 4:3 (i.e., 
640:480 pixels).. Therefore, if the determined positions of the character region are 
inconsistent with the aspect ratio of the input image, the positions of the character 
region are changed so that they are coincident with an aspect ratio of the input image. 

20 In the above position search method, an initial character region is searched 

through horizontal scanning firom the left to the right, and its position is saved as a value 
xl. Thereafter, the last character region is searched through horizontal scanning from 
the right to the left, and its position is saved as a value x2. Similarly, the initial character 
region is searched through vertical scanning from the top to the bottom using the same 

25 method, and its position is saved as a value yl. Thereafter, the last character region is 
searched through vertical scanning from the bottom to the top, and its position is saved 
as a value y2. In this manner, a position of the character region can be searched. 

After the position of a character region is searched in step 317 of FIG. 6 
30 through the procedure of FIG 8, an image corresponding to the position of the searched 
character region is extracted in step 319. An image located between the left top and 
right bottom points (xl, yl) and (x2, y2) of the character region is extracted from the 
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input image as the character region. Pixels in the extracted character region constitute an 
image existing between the point xl and the point x2 in the horizontal direction and 
between the point yl eind the point y2 in the vertical direction in the input image. The 
pixels of the character region constitute an image of the character region in which the 
5 background region is removed from the input image. 

After step 319, the image of the extracted character region is extended to a size 
of the input image in step 321. The image extension can be implemented by 
interpolation. In this embodiment of the present invention, it is assumed that the image 
10 extension is implemented by the bilinear interpolation defined as Equation (4). In step 
323, the image of the extended character region can be output to the recognition part 
170 or saved for other uses. 

FIG 9 is a flowchart illustrating a procedure for extending the character region 
15 in an image according to a further embodiment of the present invention. FIGs. lOA to 
lOH are diagrams illustrating images generated in the procedure of FIG 9. 

A device for extending a character region in an image according to an 
embodiment of the present invention will now be described with reference to FIGs. 9 

20 and lOA to lOH. In step 600, the input part 110 receives an input image shown in FIG 
lOA. It is assumed herein that the input image is comprised of 640 (column) x 480 
(row) pixels and can be a grayscale image not having color information. In this 
embodiment of the present invention, it is assumed that the image is a grayscale image. 
Thereafter, in step 610, the mean filter 180 performs mean filtering on the input image 

25 of FIG lOA, and generates a blurred image shown in FIG lOB, so that the background 
region outside the character region of the image does not affect the character region 
classification process. 

Thereafter, in step 620, the block classification part 120 divides the mean- 
30 filtered image into blocks having a predetermined size, analyzes pixels included in the 
divided blocks, classifies the blocks into character blocks and background blocks, and 
converts the pixels in the character blocks into pixels having a specified value. Through 
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the block classification, the image is classified into character blocks and background 
blocks, and pixels in the character blocks are converted to white pixels, and pixels in the 
background blocks are converted into black pixels. Thus, the image is filled with white 
or black pixels according to classified blocks. The image generated by the block 
5 classification part 120 is shown in FIG IOC. 

If the block-classified image of FIG. IOC is generated in step 620, the 
subsampling part 190 subsamples the block-classified image of FIG IOC in step 630, 
and generates the image of FIG lOD, in which the number of vertical and horizontal 

10 pixels are reduced. Subsampling is performed to increase the filtering rate by decreasing 
the filter window in the following median filtering process. FIG lOD shows an image 
subsampled at a subsampling ratio of (2:1)^. After the subsampling, the median filter 
130 performs median filtering in step 640 on the subsampled image. The median 
filtering is performed to remove isolated character blocks erroneously classified as 

15 character blocks due to the edges or noise of the input image. The median-filtered image 
is shown in FIG lOE. After the erroneously classified character blocks are removed 
through median filtering, the interpolation part 195 performs an interpolation in step 
650 on the horizontal and vertical pixels in the median-filtered image of FIG lOE. By 
performing the interpolation of step 650, the size of the image is extended to that of the 

20 input image as shown in FIG lOF. 

In step 660, position search part 140 horizontally and vertically scans the 
interpolated image of FIG lOF, and searches for a position of the character region. The 
position search part 140 horizontally scans the median-filtered image and searches for a 

25 point xl at the leftmost character block and a point x2 at the rightmost character block. 
Furthermore, the position search part 140 vertically scans the median-filtered image, 
and searches for a point yl at the topmost character block and a point y2 at the 
bottommost character block. Thereafter, in step 670, the position search part 140 
determines left top and right bottom points (xl, yl) and (x2, y2) of the character region 

30 in the image according to the results of the search. The left top and right bottom points 
(xl, yl) and (x2, y2) of the character region are determined based on the aspect ratio of 
the input image, such that the distortion of the image can be substantially reduced or 
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prevented when the following ROC extension part 160 extends the image. 

After the position of the character region is searched, the ROC extraction part 
150 extracts an image existing in the searched position of the character region from the 
5 input image of FIG lOA in step 680. The character region is extracted by extracting the 
image existing between left top and right bottom points (xl, yl) and (x2, y2) of the 
character region from the image of FIG lOA, and the extracted image is shown in FIG. 
lOG The extracted image of the character region, shown in FIG lOQ is located between 
the point xl and the point x2 in the horizontal direction and between the point yl and 
10 the point y2 in the vertical direction in the input image. The image of the character 
region becomes the image of the character region in which a background region is 
removed from the input image. 

After the image of the character region is extracted, the ROC extension part 
15 160 extends, in step 690, the image of the character region, shown in FIG lOQ to a size 
of the input image as shown in FIG lOH. The image extension can be implemented by 
interpolation. In the embodiments of the present invention, the image extension can be 
implemented by bilinear interpolation. In step 700, the extended image of FIG lOH is 
output to the recognition part 170, or can be used for other purposes. 

20 

In the new image signal preprocessing operation as described above, the 
position of the character region in an input image is searched, the image of the searched 
character region is extracted, and the extracted image of the character region is extended 
to the size of the input image, so that only the character region is subject to character 
25 recognition, contributing to an improvement in recognition performance. In addition, 
the image is classified into character regions and background regions, and the blocks 
erroneously classified as character regions are removed to thereby improve search 
performance of a character region. 

30 While the invention has been shown and described with reference to a certain 

preferred embodiment thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made therein without departing from the 
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spirit and scope of the invention as defined by the appended claims. 
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